SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Boyd, Michael R. 

Gustafson, Kirk R. 
Shoemaker, Robert H. 
McMahon, James B. 

(ii) TITLE OF INVENTION: ANTIVIRAL PROTEINS AND PEPTIDES, DNA 
CODING SEQUENCES THEREFOR, AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Leydig, Voit & Mayer, Ltd. 

(B) STREET: Two Prudential Plaza, Suite 4900 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: U.S.A. 

(F) ZIP: 60601-6780 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 2 7 -APR- 1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Larcher, Carol 

(B) REGISTRATION NUMBER: 35243 

(C) REFERENCE /DOCKET NUMBER: 61037 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (312)616-5600 

(B) TELEFAX: (312)616-5700 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 10.. 312 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGATCGAAG CTT GGT AAA TTC TCC CAG ACC TGC TAC AAC TCC GCT ATC 
Leu Gly Lys Phe Ser Gin Thr Cys Tyr Asn Ser Ala lie 
15 10 



CAG GGT TCC GTT CTG ACC TCC ACC TGC GAA CGT ACC AAC GGT GGT TAC 
Gin Gly Ser Val Leu Thr Ser Thr Cys Glu Arg Thr Asn Gly Gly Tyr 
15 20 25 



2 



AAC ACC TCC TCC ATC GAC CTG AAC TCC GTT ATC GAA AAC GTT GAC GGT 144 
Asn Thr Ser Ser lie Asp Leu Asn Ser Val lie Glu Asn Val Asp Gly 
30 35 40 45 

TCC CTG AAA TGG CAG CCG TCC AAC TTC ATC GAA ACC TGC CGT AAC ACC 192 
Ser Leu Lys Trp Gin Pro Ser Asn Phe lie Glu Thr Cys Arg Asn Thr 
50 55 60 

CAG CTG GCT GGT TCC TCC GAA CTG GCT GCT GAA TGC AAA ACC CGT GCT 24 0 

Gin Leu Ala Gly Ser Ser Glu Leu Ala Ala Glu Cys Lys Thr Arg Ala 
65 70 75 

CAG CAG TTC GTT TCC ACC AAA ATC AAC CTG GAC GAC CAC ATC GCT AAC 28 8 

Gin Gin Phe Val Ser Thr Lys lie Asn Leu Asp Asp His lie Ala Asn 
80 85 90 

ATC GAC GGT ACC CTG AAA TAC GAA TAACTCGAGA TCGTA 32 7 

lie Asp Gly Thr Leu Lys Tyr Glu 
95 100 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE :' protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Leu Gly Lys Phe Ser Gin Thr Cys Tyr Asn Ser Ala lie Gin Gly Ser 
15 10 15 

Val Leu Thr Ser Thr Cys Glu Arg Thr Asn Gly Gly Tyr Asn Thr Ser 
2 0 2 5 3 0 

Ser lie Asp Leu Asn Ser Val lie Glu Asn Val Asp Gly Ser Leu Lys 
35 40 45 

Trp Gin Pro Ser Asn Phe lie Glu Thr Cys Arg Asn Thr Gin Leu Ala 
50 55 60 

Gly Ser Ser Glu Leu Ala Ala Glu Cys Lys Thr Arg Ala Gin Gin Phe 
65 70 75 80 

Val Ser Thr Lys lie Asn Leu Asp Asp His lie Ala Asn lie Asp Gly 
85 90 95 

Thr Leu Lys Tyr Glu 
100 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. . 327 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAC TAC AAG GAC GAC GAT GAC AAG CTT GGT AAA TTC TCC CAG ACC TGC 48 
Asp Tyr Lys Asp Asp Asp Asp Lys Leu Gly Lys Phe Ser Gin Thr Cys 
15 10 15 

TAC AAC TCC GCT ATC CAG GGT TCC GTT CTG ACC TCC ACC TGC GAA CGT 96 
Tyr Asn Ser Ala lie Gin Gly Ser Val Leu Thr Ser Thr Cys Glu Arg 
20 25 30 

ACC AAC GGT GGT TAC AAC ACC TCC TCC ATC GAC CTG AAC TCC GTT ATC 
144 

Thr Asn Gly Gly Tyr Asn Thr Ser Ser lie Asp Leu Asn Ser Val lie 
35 40 45 

GAA AAC GTT GAC GGT TCC CTG AAA TGG CAG CCG TCC AAC TTC ATC GAA 192 
Glu Asn Val Asp Gly Ser Leu Lys Trp Gin Pro Ser Asn Phe lie Glu 
50 55 60 

ACC TGC CGT AAC ACC CAG CTG GCT GGT TCC TCC GAA CTG GCT GCT GAA 240 
Thr Cys Arg Asn Thr Gin Leu Ala Gly Ser Ser Glu Leu Ala Ala Glu 
65 70 75 80 

TGC AAA ACC CGT GCT CAG CAG TTC GTT TCC ACC AAA ATC AAC CTG GAC 2 88 

Cys Lys Thr Arg Ala Gin Gin Phe Val Ser Thr Lys lie Asn Leu Asp 
85 90 95 

GAC CAC ATC GCT AAC ATC GAC GGT ACC CTG AAA TAC GAA 327 
Asp His lie Ala Asn lie Asp Gly Thr Leu Lys Tyr Glu 
100 105 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Asp Tyr Lys Asp Asp Asp Asp Lys Leu Gly Lys Phe Ser Gin Thr Cys 
1 5 10 15 

Tyr Asn Ser Ala lie Gin Gly Ser Val Leu Thr Ser Thr Cys Glu Arg 
20 25 30 

Thr Asn Gly Gly Tyr Asn Thr Ser Ser lie Asp Leu Asn Ser Val lie 
35 40 45 

Glu Asn Val Asp Gly Ser Leu Lys Trp Gin Pro Ser Asn Phe lie Glu 
50 55 60 

Thr Cys Arg Asn Thr Gin Leu Ala Gly Ser Ser Glu Leu Ala Ala Glu 
65 70 75 80 

Cys Lys Thr Arg Ala Gin Gin Phe Val Ser Thr Lys lie Asn Leu Asp 
85 90 95 

Asp His lie Ala Asn lie Asp Gly Thr Leu Lys Tyr Glu 
100 105 



